Picture for Nan Wang

Nan Wang

University of California, Santa Cruz

Can Vision-Language Models Handle Long-Context Code? An Empirical Study on Visual Compression

Add code
Jan 31, 2026
Viaarxiv icon

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

Add code
Jan 27, 2026
Viaarxiv icon

Mimic Human Cognition, Master Multi-Image Reasoning: A Meta-Action Framework for Enhanced Visual Understanding

Add code
Jan 12, 2026
Viaarxiv icon

Advanced Global Wildfire Activity Modeling with Hierarchical Graph ODE

Add code
Jan 04, 2026
Viaarxiv icon

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Add code
Dec 29, 2025
Viaarxiv icon

Fast Reconstruction of Motion-Corrupted Data with Mobile-GRAPPA: Motion and dB0 Inhomogeneity Correction Leveraging Efficient GRAPPA

Add code
Nov 09, 2025
Viaarxiv icon

SciGPT: A Large Language Model for Scientific Literature Understanding and Knowledge Discovery

Add code
Sep 09, 2025
Viaarxiv icon

One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation

Add code
Sep 09, 2025
Figure 1 for One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
Figure 2 for One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
Figure 3 for One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
Figure 4 for One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
Viaarxiv icon

zkUnlearner: A Zero-Knowledge Framework for Verifiable Unlearning with Multi-Granularity and Forgery-Resistance

Add code
Sep 08, 2025
Viaarxiv icon

An Agentic Model Context Protocol Framework for Medical Concept Standardization

Add code
Sep 04, 2025
Viaarxiv icon